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ABSTRACT 

We investigate the power of the caustic technique for identifying substructures of galaxy clusters 
from optical redshift data alone. The caustic technique is designed to estimate the mass profile of 
galaxy clusters to radii well beyond the virial radius, where dynamical equilibrium does not hold. Two 
by-products of this technique are the identification of the cluster members and the identification of 
the cluster substructures. We test the caustic technique as a substructure detector on two samples 
of 150 mock redshift surveys of clusters; the clusters are extracted from a large cosmological Wbody 
simulation of a ACDM model and have masses of M200 r\j 10^^h ^Mq and M200 ^ 10^^h in 

the two samples. We limit our analysis to substructures identified in the simulation with masses 
larger than With mock redshift surveys with 200 galaxies within 3i?200: (1) the caustic 

technique recovers ^ 30 — 50% of the real substructures, and (2) ^ 15 — 20% of the substructures 
identified by the caustic technique correspond to real substructures of the central cluster, the remaining 
fraction being low-mass substructures, groups or substructures of clusters in the surrounding region, 
or chance alignments of unrelated galaxies. These encouraging results show that the caustic technique 
is a promising approach for investigating the complex dynamics of galaxy clusters. 

Subject headings: galaxies: clusters: general, (cosmology:) large-scale structure of universe, methods: 

N-body simulations 


1. INTRODUCTION 

Galaxy clusters, as the high-mass tail of the hierarchi¬ 
cal structure, connect the large-scale structure to galax¬ 
ies, and are thus relevant for constraining models of 
galaxy evolution, structure formation, and cosmology. 
Due to the large scale and long relaxing time of clusters, 
the presence of substructures is quite common, indicat¬ 
ing that a number of clusters are likely to be out of equi¬ 
librium. Substructures can substantially affect the esti¬ 
mate of the cluster velocity dispersion and mass (Girardi 
et al. 1996; Pinkney et al. 1996), can provide insights 
into the formation process of the cluster, and unveil the 
existence of dark matter (Markevitch et al. 2004; Glowe 
et al. 2006). The presence of substructures appears to 
be a fundamental ingredient of the galaxy-environment 
connection and for shaping the morphology-density rela¬ 
tion (e.g., Fasano et al. 2015; Girardi et al. 2015). The 
mass fraction in substructures can also probe structure 
formation and the expansion rate of the universe (Rich- 
stone et al. 1992; Kauffmann & White 1993; Mohr et al. 
1995; Thomas et al. 1998). 

Attempts to identify and investigate cluster substruc¬ 
tures have been numerous since their first discovery in 
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the optical band (Shane & Wirtanen 1954). In images 
of X-ray surface brightness, substructures are relatively 
obvious, especially with data coming from recent X-ray 
missions, including ROSAT (Kolokotronis et al. 2001; 
Schuecker et al. 2001), Chandra(Jeltema et al. 2005; 
Andrade-Santos et al. 2012; Parekh et al. 2015a), and 
XA4M-iVewton( Zhang et al. 2009). 

However, because the X-ray surface brightness de¬ 
creases rapidly with increasing radius, these observations 
can only trace substructures in the central region of clus¬ 
ters. Substructures can also appear in microwave ob¬ 
servations because free electrons in the hot X-ray gas 
originate the Sunyaev-Zeldovich effect (Komatsu et al. 
2001; Korngut et al. 2011). The location of radio halos 
in clusters also tends to coincide with the substructures 
observed in X-ray images and temperature maps (see Fer- 
etti et al. 2012, for a recent review). 

The existence of substructures in the dark matter ha¬ 
los of clusters can also be revealed by the anomalous im¬ 
ages of strong gravitational lensing systems (Kneib et al. 
1996; Mao & Schneider 1998; Mao et al. 2004), or by pe¬ 
culiar features of the halo density profiles of weak lensing 
systems (Hoekstra et al. 2000; Clowe et al. 2006; Okabe 
et al. 2010; Pastor Mira et al. 2011; Oguri et al. 2013; 
McCleary et al. 2015; Shirasaki 2015), although the con¬ 
tamination by chance alignments of unrelated massive 
systems along the line of sight can be severe (Hoekstra 
2003; Hoekstra et al. 2011; Geller et al. 2013). 

The detection of substructures from optical data, based 
on the galaxy celestial coordinates and redshifts, is still a 
common approach for studying substructures. The meth¬ 
ods can either use galaxy positions alone, redshifts alone, 
or both. 

The methods that use galaxy positions alone include 
the smoothed density-contour maps (Geller & Beers 
1982), symmetry test, angular separations test, density 
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contrast test (West et al. 1988), average two-point cor¬ 
relation function (Salvador-Sole et al. 1993), and two- 
dimensional (2D) wavelet transforms (Slezak et al. 1990; 
Escalera & MacGillivray 1995; Flin & Krywult 2006). 
They usually suffer from the contamination of back¬ 
ground and foreground galaxies, but are clearly very use¬ 
ful when spectroscopic observations are missing or incom¬ 
plete. 

The methods that only use the galaxies’ redshifts usu¬ 
ally assume that the distribution of the velocities of the 
member galaxies is Gaussian. Based on this assumption, 
the indicators like kurtosis, skewness (West & Bothun 
1990; Solanes et al. 1999), and the asymmetry and tail 
indices (Bird & Beers 1993) are designed to quantify the 
subclustering in the one-dimensional (ID) redshift distri¬ 
bution. The ID Kaye’s mixture model (KMM) algorithm 
belongs to the set of Gaussian mixture model (GMM) 
methods: it assesses the presence of substructures by 
estimating the number of optimal partitions of Gaus¬ 
sian distributions (Ashman et al. 1994; Kriessler & Beers 
1997). The DEDIG A method is based on an adaptive 
kernel and identifies specific velocity components (Pisani 
1993); when the chosen kernel is Gaussian, DEDIG A re¬ 
duces to one of the GMM methods. 

Among the methods using both the galaxy positions 
and redshifts, the Dressier & Shectman (DS) method 
(Dressier & Shectman 1988; Solanes et al. 1999; Knebe & 
Muller 2000; Aguerri & Sanchez-Janssen 2010; Dressier 
et al. 2013) is certainly the most widely used. Other 
methods include the three-dimensional (3D) KMM al¬ 
gorithm (Bird 1994; Colless & Dunn 1996; Barmby & 
Huchra 1998), which, in input, requires the number of the 
substructures and an initial guess of their positions; the 
3D wavelet transforms (Escalera & Mazure 1992; Gam- 
bera et al. 1997; Girardi et al. 1997; Pagliaro et al. 1999); 
the 3D version of DEDICA (Pisani 1996; Ramella et al. 
2007); and the hierarchical tree algorithm (Serna & Ger- 
bal 1996; Adami et al. 2005). 

All of these methods mainly focus on the substruc¬ 
ture detection. However, an unambiguous association of 
galaxies to individual substructures, which enables the 
derivation of the substructure properties, like size, ve¬ 
locity dispersion, and mass, is not yet available. Here 
we explore the possibility that the caustic technique can 
contribute a step forward in this direction. 

The caustic technique (Diaferio & Geller 1997; Diafe- 
rio 1999, 2009) estimates the escape velocity of cluster 
galaxies from the cluster center to a few times the virial 
radius. The technique name derives from the two curves 
in the redshift diagram where the galaxy number density 
is expected to be infinite in the spherical collapse model 
(Regos & Geller 1989). With this technique, we can es¬ 
timate the mass and gravitational potential profiles of 
galaxy clusters to radii that extend to the cluster infall 
region (see reviews in Diaferio 2009; Serra et al. 2011). 
For the mass estimation, the caustic technique only as¬ 
sumes spherical symmetry and does not require the sys¬ 
tem to be in dynamical equilibrium. It can be used both 
in the central and in the outer regions of clusters, where 
other techniques cannot be applied. 

The first step of the caustic technique procedure is to 
arrange the galaxies in a binary tree according to a pro¬ 
jected pairwise galaxy binding energy. This step is simi¬ 
lar to the procedure described in Serna & Gerbal (1996). 


However, the caustic technique goes further and identifies 
a threshold that cuts the tree and identifies the cluster 
members. Serra & Diaferio (2013) showed that this ap¬ 
proach, combined with the location of the caustics in the 
cluster redshift diagram, returns a list of cluster mem¬ 
bers within R 200 that is 96% complete and only 2% of 
the members are actually interlopers. Within the larger 
radius 3 R 200 : where no other method is available, the 
completeness is 95% and the contamination is 8%. 

The same principle used to cut the binary tree and 
identify the cluster members provides a second threshold 
that gives a list of cluster substructures. So far, no sys¬ 
tematic analysis of the properties of these substructures 
has been performed. Here, we use Wbody simulations to 
investigate the power of the caustic technique to identify 
cluster substructures. 

In Section 2, we describe the cosmological Wbody sim¬ 
ulation and the mock cluster redshift surveys we use to 
test the caustic technique as a substructure detector. We 
briefly review the caustic technique in Section 3. We 
present our results in Section 4, and we discuss them in 
Section 5. 

2. THE SIMULATED CLUSTER SAMPLES 

We use the Goupled Dark Energy Gosmological Sim¬ 
ulations (Baldi 2012). This is the largest set to date of 
Wbody simulations that model the interaction between 
the dark energy scalar field and the Gold Dark Mat¬ 
ter (GDM) fluid. Here, however, we only consider the 
simulation of the standard ACDM model with fiducial 
WMAP7 parameters. The simulated volume is a comov¬ 
ing cube of 1 h~^ Gpc on a side {h = Hq/IOO km s~^ 
Mpc“^ is the dimensionless Hubble constant), containing 
1024^ GDM particles with a mass of 5.84 x lO^^h“^M 0 
and the same number of baryonic particles with a mass 
of 1.17 X 10^^h ^Mq. We only consider the dark matter 
particles: we assume that in the real universe galaxies are 
unbiased tracers of the velocity field of the dark matter 
particles. In fact, both Wbody simulations (e.g., Di¬ 
aferio et al. 2001; Diemand et al. 2004; Gill et al. 2004, 
2005) and observations (e.g., Rines et al. 2008) indicate 
that any velocity bias between galaxies and dark matter 
is smaller than 10%. 

Halos are identified with the Friends-of-Friends (FoF) 
aigorithm (Huchra & Geiier 1982; Davis et ai. 1985), 
which iinks particies with distances iess than the link¬ 
ing length /poF to form a group. We adopt the standard 
linking length /poF = 0.2/mean, with /mean as the mean in¬ 
terparticle separation, corresponding to the overdensity 
at viralization p/Ph = 185 (Audit et al. 1998), with pb as 
the mean background density. In this procedure, the FoF 
halos are identified by using the GDM particies as pri¬ 
mary tracers and then iinking baryonic particies to the 
group of their ciosest GDM neighbor. The characteristic 
radius of the FoF haios, R 200 , is the radius within which 
the average density (inciuding both GDM and baryonic 
particies) is 200 times the criticai density. The mass 
within R 200 is M2oo- 

We consider two sampies of 50 FoF haios, each at red¬ 
shift 2 : = 0: a massive sampie (M15 hereafter), with M 200 
ranging from 0.86 x to 3.4 x and 

median 1.1 x 10^^and a iess massive sampie (M14 
hereafter), with masses ranging from 0.95 x 
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to 1.1 X and median 1.0 x IO^^/i^^Mq. 

For each cluster, we compile three mock galaxy redshift 
catalogs. Each cluster is located at the center of the vol¬ 
ume using the periodic boundary conditions of the sim¬ 
ulation box. We assign the celestial coordinates {a, S) = 
(6^,0°) and a redshift distance cz = 36,000 km s“^ to 
the cluster center. Around the cluster, we consider a rect¬ 
angular prism enclosing the volume, corresponding to a 
solid angle that at the cluster distance covers a square 
area 12h~^ Mpc wide. The volume is centered at the 
cluster and it is 140h“^ Mpc deep. The resulting field 
of view (FOV) is 1°.6 x 1°.6. For each cluster, we apply 
this procedure to three orthogonal directions. Since the 
clusters are generally not spherically symmetric, for our 
statistical purposes we can consider these three mock cat¬ 
alogs as independent clusters. We thus obtain 150 mock 
redshift catalogs for each of the samples (M15 and M14). 

The observational volumes we extract from the simu¬ 
lation typically contain ^ 8 x 10^ particles for the M15 
sample and ^ 5 x 10^ particles for the M14 sample. Real¬ 
istic numbers of observable galaxies in these volumes are 
clearly much smaller. Therefore, we randomly sample 
the dark matter particles until we obtain a given num¬ 
ber of particles N^r within 3/^200- To explore the effect 
of galaxy sampling, we build catalogs with N^r = (100, 
200, 300, 400, and 500). Additionally, we only retain 
particles within ±4000 km s“^ from the cluster center. 
These mock galaxy redshift surveys of cluster regions are 
roughly comparable to recent large surveys of clusters 
and their surroundings, such as CIRS (Rines & Diaferio 
2006) and HeCS (Rines et al. 2013). 

A different strategy to build mock redshift surveys 
could be to keep the number of particles in the FOV 
for both the M14 and M15 samples fixed. However, this 
procedure returns mock surveys where the substructures 
of the M14 clusters are poorly sampled or, more often, 
not sampled at all, because as mentioned above the ob¬ 
servational volumes of the M14 sample are on average 
60% (5 X 10^/8 X 10^) less populated than the M15 sam¬ 
ple volumes. On the contrary, keeping N^r fixed guar¬ 
antees that we properly sample the substructures and 
guarantees that it is closer to the observational proce¬ 
dure of surveys dedicated to the study of the dynam¬ 
ics of clusters; these surveys tend to sample the volume 
more densely around the cluster, both on the sky and in 
redshift space. The very different final numbers of par¬ 
ticles in the FOV of the M14 and M15 samples (Table 
1) reflects a real effect: identifying substructures in less 
massive clusters requires denser surveys because in these 
clusters the probability of measuring the redshift of a 
galaxy that does not belong to the cluster is larger. 

For a given N^r, the total numbers of particles N in 
the FOV of a mock cluster depend on the cluster and 
its surrounding region. To investigate the effects of the 
fluctuations caused by random sampling, we repeat the 
procedure 10 times. Table 1 lists the medians and per¬ 
centile ranges of the number of particles V as a function 
of Nsr: for example, 80% of the mock catalogs of the 
Ml5 sample with N^r = 100 have N in the range of 
185 — 325. For our statistical purposes, these 10 random 
realizations of each individual line-of-sight projection of a 
given cluster with a fixed Nsr cannot be considered inde¬ 
pendent. Hereafter, we will only use these 10 realizations 


TABLE 1 

The number of particles N in the FOV. 


Nsr 

M15 (1500 clusters) 

M14 (1500 clusters) 

10% 

50% 

90% 

10% 

50% 

90% 

100 

185 

241 

325 

672 

996 

1490 

200 

369 

481 

644 

1356 

2014 

2905 

300 

553 

718 

969 

2034 

2988 

4385 

400 

739 

962 

1283 

2704 

4016 

5829 

500 

923 

1195 

1603 

3394 

5034 

7258 


to quantify the fluctuations of the random sampling on 
the cluster samples M15 and M14. Finally, in the M14 
FOV’s N is four to five times larger than in the Ml5 
samples: because the M15 clusters are an order of mag¬ 
nitude more massive than the M14 clusters, whereas the 
surrounding cosmic volumes are comparable, in the M15 
samples the random sampling reaches N^r more rapidly 
and the number of particles sampled in the surrounding 
region is proportionally smaller. 

To identify the substructures of the clusters in the 
simulations, we use the code SUBFIND (Springel et al. 
2001), whose algorithm is based on the overdensity and 
the gravitational binding energy of the particles. More 
specifically, for each FoF halo detected by the FoF algo¬ 
rithm, SUBFIND identifies candidate substructures by sort¬ 
ing the particles of the FoF halo according to their local den¬ 
sity and isolating local density maxima. This procedure pro¬ 
vides substructures whose boundaries are determined by the 
first saddle point identified in the local density field around 
each density maximum. From each substructure, we finally 
remove all particles with positive total energy (see Baldi 
2012, for further details). 

The mass of a substructure is always its total mass, 
namely the sum of the mass of the particles (both CDM 
and baryons) that are gravitationally bound to that sub¬ 
structure as identified by SUBFIND. The first row of Ta¬ 
ble 2 lists the total number of substructures with masses 
larger than 10^^h~^MQ in our two samples M15 and 
M14. The mass threshold is not arbitrary, 

but is a minimum substructure mass set by the num¬ 
ber of luminous galaxies that can be detected in current 
typical surveys. In fact, a substructure is ex¬ 

pected to contain at most a handful of galaxies brighter 
than L^. Hereafter, we will call these substructures 3D 
substructures. 

By randomly sampling the dark matter particles, the 
number of members of a 3D substructure in the mock 
catalog can be substantially reduced or even vanish. We 
only consider 3D substructures that have at least 10 par¬ 
ticles appearing in the FOV. 

Table 2 lists the total number of clusters with at 
least one 3D substructure appearing in the FOV, the 
total number of 3D substructures in all the FOVs Vgub, 
and the ratio between these detectable substructures and 
the total number of substructures (Vgub/Atot); the total 
number of substructures Vtot is listed in the first row of 
Table 2. As expected, the number of 3D substructures 
appearing in the FOV increases with increasing N^r. We 
also list the standard deviations deriving from the ten 
random realizations. We see that the random sampling 
has a moderate impact. In the Ml5 samples, the num¬ 
ber of clusters that do not show 3D substructures in the 
FOV is substantial: if we consider the total members of 
the cluster as the sum of the members of the 3D substruc- 







4 


TABLE 2 

Number of clusters with 3D substructures in the FOV and 

NUMBER OF 3D SUBSTRUCTURES. 


NsR 

M15 

M14 

Nel 

Nsub 

Ratio (%) 

Nci 

Nsub 

Ratio(%) 

- 

150 

594 ^ 

- 

150 

282 ^ 

- 

100 

15 ± 1 

15 ± 2 

2.5 

146 ± 1 

191 ± 2 

67.7 

200 

39 ± 3 

40 ± 3 

6.7 

150 ± 1 

254 ± 4 

90.1 

300 

61 ± 4 

64 ± 4 

10.8 

150 ± 0 

275 ± 3 

97.5 

400 

81 ± 4 

94 ± 5 

15.8 

150 ± 0 

282 ± 1 

100 

500 

99 ± 3 

132 ± 6 

22.2 

150 ± 0 

282 ± 0 

100 


^ -Ntot, total number of 3D substructures with masses larger than 


tures and the members of the cluster core identified by 
SUBFIND, (36 ± 8)% of the total members belong to the 
3D substructures in the M14 samples, whereas this frac¬ 
tion is only (11 ± 6)% in the M15 samples. In addition, 
as mentioned earlier, the M15 FOV’s are 4 to 5 times less 
populated than the M14 fields. Therefore, random sam¬ 
pling makes 3D substructures in the Ml5 samples vanish 
more easily than in the M14 samples. 

3. THE CAUSTIC METHOD 

According to hierarchical clustering models, clusters of 
galaxies form by the aggregation of smaller systems. The 
local gravitational potential plays a crucial role in deter¬ 
mining the galaxy velocities in addition to the radial in¬ 
fall expected in the spherical collapse model (Diaferio & 
Geller 1997). On the redshift diagram of the line-of-sight 
velocity v of the galaxies in the cluster rest frame versus 
their projected distance r from the cluster center, the 
cluster members populate a trumpet-shaped region that 
is approximately symmetric around the r axis (Kaiser 
1987; Regos & Geller 1989; van Haarlem & van de Wey- 
gaert 1993). The caustics define the boundaries of this 
region, whose amplitude A{r) decreases with increasing 
r. A{r) provides the estimate of the escape velocity pro¬ 
file from the cluster and thus its mass profile (Diaferio & 
Geller 1997; Diaferio 1999). 

To measure A{r ), the caustic technique builds a binary 
tree based on the projected galaxy pairwise energy, de¬ 
termines a threshold to cut the binary tree, identifies a 
set of candidate cluster members that in turn determines 
the cluster center and defines the redshift diagram. The 
caustic technique locates the caustics and thus A{r) from 
the galaxy number density on the redshift diagram. The 
steps that are relevant for the identification of the sub¬ 
structures we are interested in here are the construction 
of the binary tree and its threshold determination. For 
the sake of completeness, we list the details of these steps. 
Further details are provided in Diaferio (1999) and Serra 
et al. (2011). 

To build the binary tree, we proceed as follows: 
i. initially each galaxy is a group Qa] 


where Rp is the pair projected separation, H is the 
line-of-sight velocity difference and rrii = rrij = 
10^^ Mq is the typical total mass of a luminous 
galaxy; 

hi. the two groups with the smallest binding energy 
are replaced with a single group and the 
total number of groups is decreased by one; 

iv. the procedure is repeated from step (ii) until only 
one group is left. 

At this stage all the galaxies are arranged in a binary 
tree; an example is shown in Figure 1. This dendro¬ 
gram is derived from a mock catalog of the M14 sam¬ 
ple with N^r = 100 and N = 606. To identify the 
cluster members and its substructures, we need to cut 
the tree at some level. Toward this aim, we identify 
the main branch from the root to the leaves by tracing 
the node that contains the largest number of galaxies at 
each bifurcation. The leaves that hang from each node 
X of the main branch provide a velocity dispersion cr^^g. 
When walking along the main branch from the root to 
the leaves, rapidly decreases due to the progressive 
loss of galaxies that are most likely not associated with 
the cluster (Figure 2); then cr^^g reaches a “a plateau” 
at some node xi. Most of the galaxies hanging from 
this node are members of the cluster: in fact, the system 
is nearly isothermal and moving along the main branch 
toward the leaves removes the less bound galaxies that 
in general do not substantially affect the value of 
When we get close to the leaves along the main branch, 
the remaining galaxies have a binding energy that is very 
small and causes crj^g to drop again. This second rapid 
drop identifies the node X 2 that sets the limit of the a 
plateau. 

To identify the a plateau and its boundaries xi and 
T 2 , Serra et al. (2011) designed an algorithm based on 
the distribution of the velocity dispersions of the nodes, 
as detailed below. 

i. Derive the probability density distribution of the 
velocity dispersion crj^g of the leaves hanging from 
each node; an example is shown in the right panel 
of Figure 2. The mode of this distribution corre¬ 
sponds to the value ap\ of the a plateau. 

ii. To identify the nodes belonging to the a plateau, 
(1) we remove the tails beyond ±0.3crpi of the 
distribution, and (2) the 80% of the remaining 
nodes closest to api are retained as the Ns nodes 
defining the a plateau. 

hi. We choose xi among the first (i.e., closest to the 
root) five nodes of the set of the Ns nodes, as 
the node whose has the smallest discrepancy 
from cTpi; similarly we choose X2 among the last 
five nodes (i.e., furthest away from the root). 


ii. the binding energy E^is = mm{Eij}^ where Eij 
is a projected binding energy between the galaxy 
i ^ Qa and the galaxy j ^ gg, is associated to each 
group pair ga^Qg- The projected binding energy is 
estimated with the relation 


Ei 


= ^ 1 rrurrij 


Rr, 


2 rrij 


rrii 


(1) 


The first node, ti, closest to the root, is the appropri¬ 
ate level for the identification of the cluster. The thresh¬ 
old set by node xi separates the binary tree branches into 
different groups. The group containing the main branch 
is the main group and its galaxies are the candidate mem¬ 
bers of the cluster. The completeness and purity of these 
candidate members have been investigated by Serra & 
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Binary Tree 



Fig. 1. — Dendrogram of the binary tree of a simulated cluster from the M14 samples with 606 particles in the FOV. The particles are 
the leaves of the tree at the bottom of the plot. The thick path highlights the main branch of the tree. The horizontal lines show the 
levels at the two nodes xi (upper line) and X 2 (lower line) that limit the a plateau shown in Figure 2. The upper node xi is the threshold 
that identifies the main group and the surrounding groups, while the lower node X 2 separates the cluster substructures. The groups and 
substructures separated by the two thresholds are depicted with different colors: the main group is in red, two additional groups are in 
blue and green, the recognized core is in yellow, and the two substructures are depicted as blue and green dashed lines. This figure was 
generated with the software CausticApp (Serra and Diaferio, personalpages.to.infn.it/~serra/causticapp.html). 


Diaferio (2013). We consider all of the other groups sep¬ 
arated by the threshold Xi dynamically distinct from the 
cluster, or the main group, and we disregard them here¬ 
after. 

The second node, X 2 , farthest away from the root, iden¬ 
tifies the substructure candidates. We define all the sub¬ 
structures, whose members belong to the main group, as 
the 2D substructures of the cluster. We only consider 2D 
substructures with at least 10 particles. We disregard all 
of the systems separated by the threshold set by node X 2 
whose members do not belong to the main group. As an 
example. Figure 3 shows the distribution on the sky of 
the identified groups and the 2D substructures according 
to the dendrogram and a plateau of Figures 1 and 2. 

4. ANALYSIS AND RESULTS 
4.1. Cluster identification 

Because of the random sampling of particles, some sub¬ 
structures might have less than ten particles in the FOV. 
According to our limit, these substructures are not taken 
into account. If all the 3D substructures vanish from the 
field of view, the cluster has no substructures left. The 


height of the bars of each bin in Figure 4 shows the 
total fraction of clusters that have 3D substructures in 
the FOV. These fractions correspond to the number of 
clusters listed in Table 2. As noted in the previous sec¬ 
tion, almost all of the clusters in the M14 samples show 
3D substructures, whereas many clusters of the M15 sam¬ 
ples have had all of their 3D substructures disappear. 

When constructing the binary tree from the data set 
of a cluster redshift survey, the main group of the binary 
tree might identify a system different from the cluster 
we are interested in because this cluster might not be 
the richest system in the FOV. With real data sets, where 
we usually analyze the clusters individually, we can easily 
correct for this situation by reducing the area of the FOV 
or by imposing the desired cluster center. Here, where 
we analyze large samples of mock clusters automatically 
and blindly, we simply remove these cases. To check 
whether the main group identifies the correct cluster, we 
compare the 2D members, namely the members of the 
main group, with the 3D members of the cluster core 
identified by SUBFIND. We say that a cluster is correctly 
identified if at least 60% of its 3D members are in the list 
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Fig. 2. — Velocity dispersion of the leaves of each node along the 
main branch of the binary tree shown in Figure 1. The vertical 
dashed and solid lines show the nodes xi and X 2 respectively. The 
curve between xi and X 2 is the a plateau, whose position is indi¬ 
cated by the peak of the histogram of node numbers shown in the 
right panel. 

of the 2D members. In Figure 4 the sum of the blue and 
cyan sectors of the bars shows the fraction of correctly 
identified clusters; the red sectors of the bars show the 
fraction of misidentified clusters. 

The fraction of correctly identified clusters is further 
split into the fraction of clusters with 2D substructures, 
shown by the blue sectors, and the fraction of clusters 
with no 2D substructures, shown by the cyan sectors. 
The lack of 2D substructures is usually caused by a rela¬ 
tively low second threshold X 2 that does not leave a suf¬ 
ficiently large number of particles for the substructure 
identification. 

In passing, we note that we confirm the results of Serra 
& Diaferio (2013), who find that on average in clusters 
identified by the a plateau, 13% of the members within 
3i?2oo are actually interlopers (see their Table 1, 7^^ col¬ 
umn, 9^^ row). Therefore, increasing our 60% threshold 
only slightly decreases the fraction of correctly identi¬ 
fied clusters, for example, only by 1% if we increase the 
threshold to 80%. Substructures are more poorly popu¬ 
lated and proportionally more difficult to detect than 
clusters. As we see below, for substructures, a 60% 
threshold turns out to be a reasonable compromise be¬ 
tween the completeness and the success rate: we thus 
also use the 60% threshold for the clusters to adopt a 
single criterion for both structures. 

4.2. 2D versus 3D substructures: The success rate 

To quantify whether the 2D substructures correspond 
to the 3D substructures, we make a one-to-one com¬ 
parison between the members of the 2D substructures 
identified by the binary tree and the members of the 3D 
substructures identified by SUBFIND. A single 2D sub¬ 
structure may contain members belonging to different 
3D substructures or none. We find that in all of the M15 
and M14 samples combined, 49% of the 2D substruc¬ 
tures contain at least one member of a 3D substructure. 
For each of these 2D substructures, we define /sd as the 
largest fraction of its total number of members that are 
also members of a single 3D substructure. Figure 5 shows 


X (/i ‘ Mpc) 
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Fig. 3. — Sky diagrams of the groups (upper panel) and 2D sub¬ 
structures (lower panel) of the cluster whose dendrogram and a 
plateau are shown in Figures 1 and 2. The projected celestial 
coordinates are in radiants; comoving coordinates in the V-body 
simulation are also shown. Particles with the same color belong 
to the same structure identified by the binary tree: clusters and 
groups in the upper panel, and substructures in the lower panel. 
The color code is 
the same as in Figure 1. 

the distributiou of /sd: 51% of the 2D substructures have 
au /sD larger thau 0.8. We adopt fsD = 0.6, a value 
smaller thau the mediau /sd = 0 .8, as the threshold to 
cousider a 3D substructure successfully ideutified by a 2D 
substructure. Adoptiug a smaller threshold iucreases the 
success rate at the expeuse of iucreasiug the discrepaucy 
betweeu the properties of the 2D aud 3D substructures. 
A larger threshold makes the ideutificatiou more solid, 
but substautially drops the success rate. 

It cau happeu that differeut 2D substructures coutaiu 
members of the same 3D substructure. This eveut occurs 
for 12% of the 2D substructures of the M14 samples aud 
for 1.2% of the 2D substructures of the M15 samples. lu 
these cases, we take the 2D substructure coutaiuiug the 
largest uumber of the 3D substructure members as the 
match to the 3D substructure. 

Figure 6 shows a raudom example of the substructure 
ideutificatiou. The cluster has ouly oue 3D substructure, 
whose ceuter is iudicated by the yellow star. The caus¬ 
tic method returus seveu 2D substructures iu additiou 
to the cluster core, iudicated by the yellow square, which 
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Fig. 4.— Fraction of clusters with 3D substructures in the FOV. 
Around each Nsr^ the left (right) bar is for the M15 (M14) sample. 
The blue sectors show the fraction of correctly identified clusters 
with 2D substructures. The cyan sectors show the fraction of cor¬ 
rectly identified clusters without 2D substructures. The red sectors 
show the remaining fraction of clusters that are not on the main 
branch of the binary tree. The error bars show the standard de¬ 
viations of these fractions deriving from the ten random sampling 
realizations. 



Fig. 5. — Distribution of /sd, the largest fraction of the total 
number of members of a 2D substructure that are also members of 
a single 3D substructure. The error bars show the 10% and 90% 
percentile ranges from the ten random sampling realizations. The 
dashed line is the cumulative distribution function. 

is correctly matched. One of the 2D substructures cor¬ 
rectly coincides with the 3D substructure. Out of the 
remaining six 2D substructures that do not correspond 
to any 3D substructure of the cluster identified with SUB¬ 
FIND, two are close to the core and four are relatively 
distant from the cluster center. We consider these six 2D 
substructures to be false detections. 

In fact, in our statistical analysis, we consider as false 
detections all the 2D substructures that do not corre¬ 
spond to the 3D substructures that SUBFIND associates 
with the analyzed cluster. However, our choice is rather 
restrictive. Figure 7 shows the 3D distribution of the 
system shown in Figure 6: out of the six 2D substruc- 


Fig. 6.— Example of the identification of substructures with the 
caustic technique. This cluster from the M14 sample contains 500 
particles within 3i^200 with 4021 dark matter particles in the FOV 
(black dots). Here we only show a fourth of the original FOV 
area around the cluster center indicated by the square. The star 
shows the center of the only 3D substructure with mass larger than 
10^^h~^M q. The colored circles show the members of the seven 
2D substructures and of the cluster core identified with the caustic 
technique. 


tures, only one system is due to chance alignment; the 
remaining five 2D substructures are clearly bound sys¬ 
tems. Two of them, close to the cluster core, are not 
in our list of 3D substructures because they have masses 
smaller than Mq. The remaining three 2D sub¬ 

structures are groups surrounding the cluster center. As 
mentioned above, we do not include them in our list of 
3D substructures in order to restrict our analysis to the 
3D substructures of the analyzed cluster. 

This random example shows that the rate of successful 
detections and the completeness we will show below are 
clearly correct in the context of focusing on the massive 
substructures of individual clusters, but are likely to be 
lower limits to the performance of the identification of 
bound systems from redshift data with the caustic tech¬ 
nique. 



Fig. 7.— The 3D distribution of the system shown in Figure 6. 
To provide the scale and a 3D perspective, we overplot a box with 
dimensions 7 X 7 x 20h~^ Mpc^. The symbols and colors are as in 
Figure 6. 


Figures 6 and 7 suggest that the 2D substructures 
at large projected distances from the cluster center are 

























likely to be groups and substructures of surrounding clus¬ 
ters. We thus compute the success rate, namely the ratio 
between the number of 2D substructures that correspond 
to 3D substructures and the total number of 2D substruc¬ 
tures, as a function of distance from the cluster center. 
Figure 8 shows this relation for the combined M14 and 
Ml5 samples. The success rate peaks at different radii in 
the two samples because the clusters have different sizes. 
When the radii are normalized to R 200 , both peaks ap¬ 
pear in the range ^ 2 — 3i?200- We keep the length in 
this plot in proper units, because these units are more 
convenient with real data; in addition, the mass distri¬ 
bution of each cluster sample is very peaked and the dis¬ 
tance normalization plays a little role within the same 
cluster sample. The success rate shows a broad peak be¬ 
tween ^ 1 and ^ 3h~^ Mpc for the M14 samples and 
between ^ 3 and ^ Mpc for the M15 samples, 

whereas it decreases at the center and in the outskirts of 
the clusters. The low rate at small distances is due to the 
cases where the cluster core is identified as a substructure 
rather than as the core itself. Again, we consider these 
cases to be false detections because we are interested in 
the 3D substructure identification, although these identi¬ 
fied 2D substructures actually are bound systems. In the 
cluster outskirts, the number of 3D substructures clearly 
decreases, unlike the number of 2D substructures. There¬ 
fore, we can introduce a distance criterion by removing 
all of the 2D substructures at distances outside a given 
range that, based on Figure 8, we arbitrarily choose to 
be [0.1,6]h“^ Mpc: this criterion can remove most of 
the many false detections without missing promising 2D 
substructures. 



Fig. 8. — Success rate as a function of the projected distance of 
the 2D substructures from the cluster center. The error bars show 
the 10% and 90% percentile ranges from the ten random sampling 
realizations. 

Figure 9 shows the success rate as a function of 
with the distance criterion applied. The average num¬ 
ber of 2D substructures increases with Nsr^ as expected, 
whereas the success rate decreases from 25% at N^r = 
100 to 15% at Nsr = 500 (these fractions can be read off 
in Figure 9 from the ratio between the length of the blue 
sector of each bar and the total length of the bar): In 
fact, a larger number of particles in the FOV increases 


the sampling of the 3D substructures, but at the same 
time it increases the probability of detecting 2D substruc¬ 
tures by chance alignment. The cyan sectors of the bars 
show the fraction of 2D substructures whose members 
are 3D members of the cluster core rather than members 
of the 3D substructures. The error bars show that the 
random sampling fluctuations have a small impact. 





Fig. 9.— Average number of 2D substructures per cluster as a 
function of N^r: The left (right) bars are for the M15 (M14) sam¬ 
ples. The blue sectors show the 2D substructures that correspond 
to 3D substructures; the cyan sectors show the 2D substructures 
that are actually part of the cluster core; the red sectors show the 
false detections. The error bars show the Icr deviation from the 10 
random sampling realizations. 

We can look at our results from a different perspective. 
Figure 10 shows the distribution of the success rate. The 
y- and x-axes show N^r and the success rate, respec¬ 
tively. The gray scale shows the fraction of clusters on 
this plane. For example, the bottom row of the bottom 
panel shows that 63% of the clusters in the M15 sample 
with Nsr = 100 have no successful detection, whereas in 
34% of the clusters all the 2D substructures correspond 
to real 3D substructures. 

This figure clearly shows that clusters are not uni¬ 
formly distributed on this plane. In fact, the number 
of 2D substructures is discrete and may be small, with 
just one or two 2D substructures, especially when Nsr 
is small. Figure 10 also shows that in all of the Ml5 
samples, about 60% of the clusters have 2D substruc¬ 
tures that are false. This result is due to the fact that 
there are fewer particles belonging to 3D substructures 
in the FOV of massive clusters (see Sect. 2). The lack 
of substructures makes the boundary of the a plateau 
quite ambiguous and, consequently the identification of 
the second threshold X 2 becomes more problematic. On 
the contrary, the success rate in the M14 samples is more 
equally distributed on the plane and the fraction of false 
2D substructures is proportionally smaller. 

4.3. Completeness 

We now estimate the completeness of our samples of 
substructures, namely the ratio between the number of 
correctly identified 3D substructures and the total num¬ 
ber of 3D substructures. We note that the total number 
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Fig. 10. — The distribution of the clusters in the plane N^r 
vs. success rate. The grey scale shows the number of clusters 
normalized to the total number of clusters with 3D substructures 
in the FOV. 

of 3D substructures only includes the 3D substructures 
more massive than 10 ^^and with at least 10 par¬ 
ticles in the FOV. 



Fig. 11.— Completeness of the substructure catalogs. The left 
(right) bars are for the M15 (M14) samples. The blue sectors show 
the average number of 3D substructures properly detected in each 
cluster; the red sectors show the average number of missed 3D sub¬ 
structures in each cluster. The error bars show the Icr fluctuations 
from the ten random sampling realizations. 


The completeness of the individual samples is shown 
in Figure 11. The completeness increases with from 
18% to 29% for the M15 samples, and from 34% to 60% 
for the M14 samples. As for Figure 9, these fractions 
can be read off in Figure 11 from the ratio between the 
length of the blue sector of each bar and the total length 
of the bar. 

Figure 12 shows the completeness as a function of the 
substructure mass for the M14 samples. Clearly, the 
most massive substructures are detected more easily, and 
more substructures are recognized in denser fields. The 
largest 3D substructure mass is 1.19 x lO^^h ^Mq; this 
mass is larger than the maximum M 200 of the M14 sam¬ 
ple, but it still is a factor of 2.2 smaller than the minimum 


total cluster mass 2.66 x Mq, which is defined as 

the sum of the mass of the dark matter and baryonic 
particles of the FoF halo, consistently with the compu¬ 
tation of the mass of the 3D substructures. Figure 12 
shows that we can obviously improve the completeness 
of our substructure sample by increasing the lower mass 
limit. The result is qualitatively similar to the results of 
the M15 samples, which we do not show here, although 
in this case, the trend has larger fluctuations and dis¬ 
continuities due to the limited number of clusters and 
detected substructures. 



Fig. 12.— Completeness vs. 3D substructure mass in the M14 
samples for three different N^r. The error bars show the Icr ffac¬ 
tuations from the ten random sampling realizations. 


To illustrate the origin of the relations plotted in Fig¬ 
ure 12, we show two examples of the substructure mass 
function in Figure 13. The red histogram is the original 
mass function of the 3D substructures in the 150-cluster 
M14 sample, whereas the blue histogram shows the esti¬ 
mated mass function: the original mass function is not 
monotonic and is severely under-represented by the esti¬ 
mated mass function at the low-mass end. 

This analysis of the dependence of the completeness on 
the minimum mass of the 3D substructure unfortunately 
cannot be repeated with the success rate. In fact, the 
substructure mass does not enter our algorithm for the 
identification of substructures from redshift data: chang¬ 
ing the mass threshold would only change the number of 
3D substructures, but it would leave the number of 2D 
substructures unaltered. Therefore, if we increase the 
mass threshold, the success rate simply decreases; if we 
decrease the mass threshold below we start 

probing substructures with one or two bright galaxies at 
most, namely substructures that are virtually impossible 
to detect with current redshift surveys. 

5. DISCUSSION 

5.1. Possible developments of the eaustie teehnique 

In the previous sections we show that the caustic tech¬ 
nique, when blindly applied to redshift surveys of clus¬ 
ters, provides encouraging values of the success rate and 
completeness. In principle, these results could be im¬ 
proved when a cluster is analyzed individually. In fact. 
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— 3D substructures 



Mass {lO^^h ^Mg) 


Fig. 13.— Substructure mass functions. The upper (lower) panel 
is from the M14 sample with N^r = 500 (100). The open his¬ 
tograms show the real 3D substructure mass function, whereas the 
solid histograms show the mass function of the detected 3D sub¬ 
structures. The error bars show the Icr fluctuations from the ten 
random sampling realizations. 

the assumption that the two nodes, xi and X 2 , and their 
corresponding thresholds alone are sufficient to separate 
the groups surrounding the cluster as well as the cluster 
substructures might be too simplistic in some clusters: 
their dynamical state can be complex enough that the 
deviation from the single isothermal sphere is relevant. 
In this case, looking for a single a plateau on the main 
branch might fail to capture the full information on the 
cluster dynamics contained in the binary tree. 

For example, the velocity dispersion of the main group 
can substantially differ from the velocity dispersion of 
the substructures, and in turn different substructures can 
have different velocity dispersions. This situation can 
generate multiple a plateaus on the main branch and 
the determination of the proper thresholds is ambiguous. 
In these cases, the algorithm might not identify the main 
group and its substructures satisfactorily and might miss 
a substantial number of 3D substructures with velocity 
dispersions smaller than the velocity dispersions set by 
the a plateau. 

Figure 14 illustrates this argument: it shows the dis¬ 
tributions of the velocity dispersions of the 3D cores 
(green histograms), 3D substructures (red histograms), 
and the 2D substructures corresponding to 3D substruc¬ 
tures (identified 3D substructures, blue histograms) for 
all of the combined M14 and M15 samples. The velocity 
dispersions of the 3D cores and substructures are com¬ 
puted with the full list of members provided by SUB¬ 
FIND, whereas the velocity dispersion of the identified 
3D substructures are computed from the list of members 
derived from the binary tree. In the M14 samples, the 
velocity dispersions of the cores and of the 3D substruc¬ 
tures substantially overlap. Therefore the velocity dis¬ 
persion corresponding to the single a plateau of the main 
branch of the binary tree can properly identify both the 
main group and the substructures. On the contrary, in 
the M15 samples, the distributions of the velocity disper¬ 
sions of the cores and of the 3D substructures are almost 
completely separated and the a plateau that identifies 
the main branch is unlikely to alo properly identify the 


substructures. Figure 14 thus suggests that this is the 
origin of most of the differences between the substruc¬ 
ture identification results of the M14 and Ml5 samples 
that we describe in the previous sections. 



Velocity Dispersion (km s 0 

Fig. 14.— Distributions of the velocity dispersions of the 3D sub¬ 
structures (red histograms), 3D cluster cores (green histograms), 
and identifled 3D substructures (blue histograms). The upper 
(lower) panel shows the M14 (M15) cluster samples. 

In conclusion, the closer the velocity dispersions of the 
3D substructures are to the velocity dispersion of their 
cores, namely to the a plateau, the more effectively our 
algorithm detects the 3D substructures. When applied 
to clusters individually, as it can happen with real cata¬ 
logs containing fewer clusters, if the shape of the veloc¬ 
ity dispersion on the main branch vs. the node number 
(Figure 2) is more complex than a single a plateau, the 
substructure detection can be improved by tuning the 
substructure threshold according to this shape. We plan 
to investigate this issue in detail in future work. 

As a final note, we emphasize that our analysis is based 
on the assumption that a set of dark matter particles 
randomly sampled from an Wbody simulation is a fair 
representation of a real sample of galaxies. This assump¬ 
tion is expected to be valid at the high-mass end of the 
dark matter halo mass function, but it can become pro¬ 
gressively incorrect at decreasing masses (Sawala et al. 
2015). Therefore, luminosity and velocity segregations 
can partly differentiate the phase-space distributions of 
galaxies from those of an unbiased sample of dark mat¬ 
ter particles. Associating methods, like the abundance 
matching technique (SHAM; Conroy et al. 2006; Vale & 
Ostriker 2006), which assumes a monotonic relationship 
between observed galaxy luminosity and simulated halo 
mass, can be used to make tests on mock catalogs that 
resemble real data sets more closely. In future work we 
will explore this issue: we expect it to be particularly 
relevant in dealing with the different sizes of the Ml5 
and M14 mock surveys (Table 1) and in addressing the 
dependence of the success rate on N^r (Figure 9) in a 
more realistic context. 

5.2. Our results in the eontext of previous work 

Assessing the presence of substructures in clusters has 
been frequently investigated in the literature based on 
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X-ray observations, weak gravitational leasing, or spec¬ 
troscopic redshifts of galaxies. However, the issue has 
been addressed at very different levels. 

X-ray analyses mainly focus on quantifying the surface 
brightness morphology, with the aim of either separat¬ 
ing relaxed clusters from merging clusters (e.g., Parekh 
et al. 2015b) or quantifying the systematic errors affect¬ 
ing cluster mass measurements (Nagai et al. 2007; Pif- 
faretti & Valdarnini 2008). The substructure identifica¬ 
tion is limited to the cluster central region within rsoo, 
which is the typical largest distance where the ICM can 
be detected reliably (Piffaretti & Valdarnini 2008), and 
to substructures that contain a quantity of hot gas large 
enough to produce a detectable X-ray emission. In addi¬ 
tion, the identification of substructures is complicated by 
the fact that viscosity and magnetic fields can displace 
the hot gas from the dominant mass distribution, as in¬ 
dicated by the observations of numerous merging clus¬ 
ters (e.g., Markevitch et al. 2004; Mahdavi et al. 2007; 
Menanteau et al. 2012). 

The mass distribution is expected to be directly de¬ 
tected, with studies based on weak gravitational lens- 
ing, whose signal is not affected by the complications of 
baryonic physics and the dynamical state of the cluster. 
However, unrelated large-scale structures along the line 
of sight (Hoekstra 2003; Hoekstra et al. 2011) and uncer¬ 
tainties on the lens parametrization (Clowe et al. 2004; 
Corless & King 2007) can be the sources of substantial 
systematic errors. For example, when weak leasing is 
applied to the identification of clusters in V-body simu¬ 
lations, projection effects limit the fraction of identified 
clusters that correspond to real clusters to 20%, whereas 
the completeness can be as large as 80% (White et al. 
2002). Shirasaki (2015) proposes a method based on 
weak leasing to identify substructures in the outer re¬ 
gions of clusters, beyond 1 arcmin of the cluster center. 
The method is tested on V-body simulations where sub¬ 
structures are identified with SUBFIND. The fraction of 
real substructures that are identified with this algorithm 
is never larger than 40%, approximately comparable to 
our results, whereas the contamination fraction is not 
mentioned. 

The identification of substructures from the galaxy po¬ 
sitions and redshifts has a long history, but the various 
algorithms have rarely been tested on realistic V-body 
simulations. Escalera & Mazure (1992) test the wavelet 
transform method on 12 toy models where the particles 
of the main cluster are distributed according to a Hubble 
profile, with a Gaussian distribution of their velocities; 
only a single substructure, which is a rescaled model of 
the main cluster, is located away from the cluster core. 
The wavelet transform method yields a significance level 
of the identified substructure and Escalera & Mazure 
(1992) show that for two of their significant substruc¬ 
tures the fraction of particles that do not actually belong 
to the 3D substructures is 3/9 and 0/7. 

Pinkney et al. (1996) investigate 31 different methods, 
including the DS and the KMM algorithms; only 5 out of 
these 31 methods use both galaxy redshifts and celestial 
coordinates. They build two-component merging clus¬ 
ters with V-body simulations and apply the methods at 
different epochs of the merging process and from differ¬ 
ent lines of sight. Therefore, the resulting mock fields of 
view contained a single substructure and no foreground 


and background large-scale structures or field particles. 
The main goal of their test is to check the efficiency of 
the substructure identification and the significance level 
of the detection. The DS method has the best perfor¬ 
mance, with a success rate of 66% atz 5% significant 
level, with a false detection rate of ^ 10% in an iso¬ 
lated isothermal cluster. The success rate of the KMM 
algorithm is around 36%, while the false detection rate 
rapidly increases with the number of particles within the 
mock catalog. 

Systematic tests of the the DEDICA algorithm are per¬ 
formed by Ramella et al. (2007). They build artificial 
clusters with a single substructure away from the cluster 
center and use Poisson noise to simulate the background 
and foreground structures. The cluster and the substruc¬ 
ture are spherically symmetric and have a King density 
profile. On these toy models, most of the time DED¬ 
ICA can detect the substructure and recover 75% of its 
members. 

Unlike the studies described above, we use mock red- 
shift surveys that are extracted from a state-of-the-art 
V-body simulation of a large cosmological volume, con¬ 
taining multiple substructures and foreground and back¬ 
ground structures and filaments, thus providing very re¬ 
alistic mock fields of view. V-body simulations similar to 
ours were adopted by weak leasing analyses (White et al. 
2002; Shirasaki 2015). To match our 2D substructures 
with the 3D substructures, we apply a criterion based on 
the individual particles that are substructure members, 
a very strict but necessary criterion for assessing the ef¬ 
ficiency of identifying the 3D substructures from 2D in¬ 
formation. Our strict criterion applied to realistic mock 
fields of view is the most relevant difference between our 
analysis and previous work. Therefore, comparing our 
results with the results of other techniques by looking at 
their claimed performance alone can be misleading. In 
this realistic context, the ability of our method to iden¬ 
tify roughly 30% to 50% of the genuine substructures 
of the cluster, independently of its dynamical state and 
in the presence of coherent structures along the line of 
sight, is remarkable. It also is important to emphasize 
that our substructure identification method only uses the 
first step of the caustic technique, where the galaxies 
are arranged in a binary tree; this step does not assume 
any spherical symmetry, unlike the unused second part 
of the caustic technique where the redshift diagram and 
the caustic locations are determined. Therefore the sub¬ 
structure identification technique does not assume any 
specific geometry for the substructure. 

Ealse identifications and incompleteness appear to be 
unavoidable in any technique. In future work, we plan 
to investigate some possible improvements, mentioned 
in the previous subsection, that are expected to reduce 
these failures and enhance the performance of the caustic 
technique. 

6. CONCLUSIONS 

We test how efficiently the caustic technique can iden¬ 
tify cluster substructures in mock redshift surveys of 
clusters extracted from Wbody simulations. We con¬ 
sider two samples of 150 clusters each with M 200 ^ 
(M14) and M 200 - lO^^h-^Mo (M15). We 
consider mock redshift surveys with different numbers of 
particles Nsr within 3^200, including = 200, which 
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is the typical size of cluster redshift surveys like CIRS 
(Rines & Diaferio 2006) and HeCS (Rines et al. 2013). 

For the N^r = 200 catalogs, among the 2D substruc¬ 
tures identified by the caustic technique between 0.1 and 
6h~^ Mpc from the cluster center, 17% and 21 % corre¬ 
spond to the real 3D substructures with masses larger 
than lO^^h“^M 0 that are identified in three dimensions, 
for the M14 and M15 samples, respectively. These num¬ 
bers represent a lower limit to the numbers of physically 
bound systems identified with the caustic technique be¬ 
cause we also label as false detections real 3D substruc¬ 
tures that are less massive than 10^^h~^MQ or that be¬ 
long to groups or clusters surrounding the cluster of inter¬ 
est. As for the completeness, the lists of 2 D substructures 
contain 48% (M14) and 29% (M15) of the 3D substruc¬ 
tures that are more massive than 10^^h~^MQ and with 
more than 10 particles in the FOV. 

Our analysis shows that the completeness of the sub¬ 
structure catalog and the successful identification of sub¬ 
structures is a strong function of the substructure mass 
and the density of the survey. However, this latter pa¬ 
rameter does not necessarily need to be as large as pos¬ 
sible, because denser surveys have a larger probability 
of chance alignments and the association of interlopers. 
Quantifying these effects more systematically requires 
further investigation. 

We show that the caustic method appears to be 
a promising technique for identifying substructures of 
galaxy clusters out to their outer regions from redshift 
surveys. When used for this purpose, the caustic tech¬ 
nique does not require the assumption of spherical sym¬ 
metry, and it is thus an ideal tool for analyzing com¬ 
plex systems. However, the method can certainly be im¬ 
proved: the caustic method arranges the galaxies in a 
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